Convergence Of The Iterated Prisoner's Dilemma Game

نویسندگان

  • Martin E. Dyer
  • Leslie Ann Goldberg
  • Catherine S. Greenhill
  • Gabriel Istrate
  • Mark Jerrum
چکیده

Co-learning is a model involving agents from a large population, who interact by playing a fixed game and update their behaviour based on previous experience and the outcome of this game. The Highest Cumulative Reward rule is an update rule which ensures the emergence of cooperation in a population of agents without centralized control, for various games and interaction topologies. We analyse the convergence rate of this rule when applied to the Iterated Prisoner’s dilemma game, proving that the convergence rate is optimal when the interaction topology is a cycle and exponential when it is a complete graph.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evolutionary stability in the n-person iterated prisoner's dilemma.

The iterated prisoner's dilemma game has been used extensively in the study of the evolution of cooperative behaviours in social and biological systems. The concept of evolutionary stability provides a useful tool to analyse strategies for playing the game. Most results on evolutionary stability, however, are based on the 2-person iterated prisoner's dilemma game. This paper extends the results...

متن کامل

Evolutionary Stability in the N - Person

The iterated prisoner's dilemma game has been used extensively in the study of the evolution of cooperative behaviours in social and biological systems. The concept of evolutionary stability provides a useful tool to analyse strategies for playing the game. Most results on evolutionary stability, however, are based on the 2-person iterated prisoner's dilemma game. This paper extends the results...

متن کامل

A Cybernetic Perspective on the Role of Noisein the Iterated Prisoner ' s Dilemma

An interpretation of the evolution of complexity in the Iterated Prisoner's Dilemma (IPD) is developed, based on Ashby's \law of requisite variety". It is demonstrated that the innuence of noise on the evolutionary dynamics of this system is critically dependent on the locus of this noise. It is also argued that noise in such an evolving system is not merely, (or necessarily) a source of variat...

متن کامل

A Trust Model of E-commerce Based on Iterated Prisoner's Dilemma Game

Along with fierce e-commerce market competitions, some sellers may be worried about losing their customers so they bribe advisors by material means. The behavior of the advisor not only depends on their intrinsic properties, but also depends on their motivation that they may provide untruthful information to obtain additional material reward. The balance between profit and information truth con...

متن کامل

The Iterated Prisoner's Dilemma on a Cycle

Pavlov, a well-known strategy in game theory, has been shown to have some advantages in the Iterated Prisoner’s Dilemma (IPD) game. However, this strategy can be exploited by inveterate defectors. We modify this strategy to mitigate the exploitation. We call the resulting strategy Rational Pavlov. This has a parameter p which measures the “degree of forgiveness” of the players. We study the evo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Combinatorics, Probability & Computing

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2002